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Abstract  —  We  consider  the  problem  of  distributed 
test  of  statistical  independence  under  communication 
constraints.  While  independence  test  is  frequently  en¬ 
countered  in  various  applications,  distributed  indepen¬ 
dence  test  is  particularly  useful  for  events  detection  in 
sensor  networks:  data  correlation  often  occurs  among 
sensor  observations  in  the  presence  of  a  target.  Focus¬ 
ing  on  the  Gaussian  case  because  of  its  tract  ability,  we 
study  in  this  paper  the  characteristics  of  optimal  scalar 
quantizers  for  distributed  test  of  independence  where  the 
optimality  is  in  the  sense  of  optimizing  the  error  expo¬ 
nent.  We  also  discuss  the  optimal  quantizer  properties 
for  the  finite  sample  regime,  i.e.,  that  of  directly  mini¬ 
mizing  the  error  probability. 

Keywords:  Distributed  signal  processing,  test  of  in¬ 
dependence,  sensor  networks. 

1  Introduction 

Test  of  statistical  independence  has  been  a  classcal 
inference  problem  [1]  and  has  found  a  wide  range  of  ap¬ 
plications,  e.g.,  in  image  processing  [2],  economics  [3]. 
The  emerging  wireless  sensor  networks  bring  new  di¬ 
mensions  and  challenges  to  this  classical  problem  as 
the  data  are  no  longer  centrally  available.  Dependence 
detection  in  distributed  systems  is  often  the  first  and 
crucial  step  in  event  detection/identification;  thus  its 
relevance  in  various  sensor  network  applications  is  quite 
evident.  One  particular  example,  which  will  be  used 
later  is  cooperative  spectrum  sensing  in  cognitive  radio 
network:  the  presence  of  the  primary  user’s  signal  in¬ 
troduces  dependence  among  the  decentralized  spectrum 
sensors. 

Take,  for  example,  the  Gaussian  case  and  consider 
the  following  hypothesis  testing  problem:  a  pair  of  ran¬ 
dom  sequences  (Xj,Y),  i  =  1,  •••,«.,  with  (Aj,Y)  in¬ 
dependent  and  identically  distributed  (i.i.d.)  according 
to  the  joint  probability  density  function 

fx,r(x,y )  =  - — 7=^exp(-— -^—j-(x2-2pxy+y2 
2ttV/1-p2  V  2(1  -  pz) 


The  two  hypothese  under  test  are 


(H0:  0 

1  H{  :  p  =  0 


(1) 


i.e.,  (X,  Y)  is  bivariate  Gaussian  and  they  are  indepen¬ 
dent  under  H\  and  dependent  under  Hq.  Notice  that 
assuming  zero  mean  and  unit  variance  does  not  lose 
any  generality  as  long  as  the  mean  values  and  variances 
are  known.  In  the  centralized  case  where  X  and  Y  se¬ 
quences  are  available,  this  statistical  inference  problem 
can  be  solved  straightforwardly  by  applying  some  stan¬ 
dard  statistical  inference  frameworks  depending  on  the 
situations  (e.g.,  whether  or  not  p  is  known  under  Hq 
)  [4]  - 

The  problem  becomes  much  more  interesting  and 
complicated  when  X  and  Y  are  not  directly  available; 
instead,  compressed  versions  of  X  and  Y  subject  to 
some  rate  constraints  are  used  for  the  test  of  indepen¬ 
dence.  This  distributed  test  of  independence  is  the  fo¬ 
cus  of  the  present  paper.  To  be  more  specific,  we  as¬ 
sume  that  X  and  Y  are  available  respectively  at  two 
distributed  sensor  nodes.  The  sensor  nodes  communi¬ 
cate  their  data  to  the  fusion  center  under  a  communica¬ 
tion  constraint  of  R\  and  R2  bits  per  observation.  The 
fusion  center,  upon  receiving  the  sensor  data,  makes 
a  final  decision  on  whether  X  and  Y  are  correlated  or 
not.  Our  attempt  is  to  understand  properties  of  optimal 
quantizers  at  distributed  nodes  where  the  optimality  is 
associated  with  the  performance  at  the  fusion  center 
with  regard  to  the  dependence  test. 

Consider  first  the  large  sample  regime,  i.e.,  n  is  large. 
Given  that  (Xl,  Y)  form  an  i.i.d.  sequence,  it  is  easy 
to  show  that  any  reasonable  quantizers  will  lead  to  a 
test  with  diminishing  error  probability  as  n  grows  for 
Ri  >  0  and  R2  >  0.  Thus  a  sensible  criterion  is  the 
speed  with  which  the  error  probability  approaches  zero, 
i.e.,  the  error  exponent  characterization.  This  is  indeed 
the  underlying  reason  for  the  problem  setting  where  the 
mull  hypothesis  Hq  represents  dependence  while  inde- 
'pendence  occurs  under  Hi.  Applying  Stein’s  lemma  [5] 
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to  the  hypothesis  testing  problem  (1),  for  a  given  type 

1  error  constraint,  the  error  exponent  for  the  type  II 

error  (i.e. ,  the  Kullback  Leibler  distance  between  the 
distributions  under  Hq  and  Hi)  reduces  to  the  mutual 
information  between  suitable  random  variables.  For  ex¬ 
ample,  with  centralized  test,  the  optimal  error  expo¬ 
nent  becomes  I(X;Y).  Our  focus  in  the  large  sample 
regime  is  to  study  quantizer  properties  in  the  context 
of  distributed  test  against  independence  with  Gaussian 
sources.  Motivated  by  practical  constraints  that  of¬ 
ten  require  simple  sensor  processing,  we  consider  only 
scalar  quantizers  at  local  sensors  with  1  bit  per  observa¬ 
tion.  That  is,  =  1  and  each  sensor  quantizer  is 

‘memoryless’.  Our  objective  will  be  therefore  to  deter¬ 
mine  the  optimal  scalar  quantizer  structure  that  maxi¬ 
mizes  I(U,  V)  where  U  and  V  are  the  one  bit  quantizer 
output  for  the  two  sensors. 

Characterizing  optimal  error  exponents  for  depen¬ 
dence  test  with  communication  constraints  was  first 
considered  by  Ahlswede  and  Csiszar  [6].  In  particular, 
for  the  special  case  of  test  of  independence  problem  with 
one  sided  data  compression,  i.e.,  i?2  =  oo,  a  single  letter 
characterization  of  the  optimal  error  exponent  was  ob¬ 
tained  in  [6] .  An  overview  of  related  work  can  be  found 
in  [7]  and  the  references  therein.  We  note  here  that  the 
majority  of  the  reported  work  are  largely  restricted  to 
(A,  Y)  being  discrete  memoryless  sources.  Distributed 
test  of  independence  with  continuous  alphabet  sources 
(e.g.,  Gaussian  sources)  have  been  much  less  investi¬ 
gated. 

We  will  also  study  distributed  test  of  independence 
in  the  finite  sample  regime,  that  is,  we  attempt  to  char¬ 
acterize  properties  of  quantizers  that  directly  minimize 
error  probability  at  the  fusion  center. 

The  rest  of  the  paper  is  organized  as  follows.  In  Sec¬ 
tion  2,  we  give  the  problem  statement  and  our  main 
results.  Section  3  are  numerical  examples.  At  last,  we 
conclude  in  section  4. 

2  Problem  Statement  and  Main 
Results 

2.1  Large  sample  regime 

Consider  the  hypothesis  test  described  in  (1).  The 
fusion  center  does  not  have  direct  access  to  the  source 
sequence  (Xj,  1)),  i  =  1,2 ,  •  •  • ,  n,  but  can  be  informed 
about  the  sources  only  at  limited  rates.  Precisely,  the 
local  sensors  apply  scalar  quantizers  to  their  respective 
observations: 

Ut  =  7i  (Xi) 

V  =  72  CL) 
where  Ut  and  V)  £  {0, 1}. 

For  the  large  sample  regime,  the  fusion  center  will  de¬ 
cide  Hq  or  Hi  given  the  sequence  ( Ui ,  Vi)  i  =  1,  •  •  • ,  n 
and  we  are  to  characterize  the  optimal  quantizers  that 


maximize  the  error  exponent.  Using  the  Neyman- 
Pearson  criterion,  we  assume  that  the  rejection  region 
is  the  set  B  C  Xn  whose  complement  of  B  is  B.  The 
minimum  probability  of  type  II  error  for  a  prescribed 
arbitrary  small  probability  of  type  I  error,  denoted  by 
Pri.  fl2(n,  e),  is  defined  as 

e)  =  min{Qn(B)\B  C  Xn ,  Pn(B)  <  e}  (2) 

B 

The  error  exponent  associated  with  /3r 1,R2(n,e)  is,  un¬ 
der  the  problem  setup,  the  mutual  information  between 
U  and  V,  I(U,  V).  Our  problem  becomes  finding  a  pair 
of  binary  quantizers  such  that  I(U,  V)  is  maximized. 

By  restricting  each  sensor  to  a  one  bit  scalar  quan¬ 
tizer,  we  have  the  following  result. 

Theorem  1  For  the  distributed  test  of  independence 
problem  described  in  (1)  where  each  local  quantizer  is 
restricted  to  be  one  bit  scalar  quantizer  with  a  single 
threshold,  the  optimal  quantizers  that  maximize  the  er¬ 
ror  exponent  are  a  sign  detector,  i.e.,  a  binary  quantizer 
with  threshold 

ti  =  t2  =  0  (3) 

While  the  result  is  rather  intuitive  with  the  symmet¬ 
ric  problem  setting,  the  proof  is  rather  lengthy  and  is 
sketched  in  the  Appendix.  Notice  that  the  result  relies 
on  the  assumption  of  a  single  threshold  quantizer:  it  is 
not  known  if  such  restriction  may  be  relaxed  though  it 
appears  to  be  the  case  from  extensive  numerical  exam¬ 
ples. 

2.2  Finite  sample  regime 

For  the  finite  sample  regime,  we  consider  a  Bayesian 
approach  where  the  priors  for  the  two  hypotheses  are 
assumed  to  be  7Tq  and  tti  respectively.  We  derive  quan¬ 
tizer  properties  for  minimum  error  probability  with 
both  two-sided  and  one-sided  compression,  with  the  lat¬ 
ter  refering  to  the  situation  in  which  the  fusion  center 
has  full  data  from  one  sensor  while  compressed  data 
from  another.  This  situation  arises  naturally  in  the 
case  where  one  of  the  sensors  is  tasked  with  the  final 
decision  making. 

For  the  finite  sample  regime,  we  adopt  the  person-by¬ 
person  optimal  approach  and  obtain  the  following  result 
for  two-sided  compression,  following  standard  approach 
described  in  [8]. 

Proposition  1  For  the  distributed  testing  of  indepen¬ 
dence  problem  with  one  bit  quantization  defined  above. 
If  we  further  assume  the  fusion  rule  satisfies, 

P{Uo  =  L\U  =  1,V  =  j)  >  P(Uo  =  l\U  =  0,V  =  j) 
P(U0  =  0\U  =  0,V  =  j)  >  P(U0  =  0\U  =  1,V  =  j) 


1 

Ai  =  Y\p(uo  =  MUi  =  l,Ui  =  j)- 

3=0 

1 

Bi  =  J2[P(Uo  =  0\Ul  =  01U1=j)- 

3=0 

for  all  j  =  {0, 1},  then  the  optimal  local  decision  rule 
at  ith  sensor  is  given  by: 


f  BiP(xj\xiHi)dxj 
J  X-j 


p(Ui  =  11*0  =  ' 

J  1 

Af  Jxi  ^  iro 

J  f^_AiP(x-i\xiH0)dx-i  -  *-1  (6) 

l  0 

otherwise 

where  i  =  3  —  i, 

for  i 

=  1,2,  (hence,  aq  =  Y),  7Tq  = 

P(Hq),  7Ti  =  P(H i),  and  Ai,Bi,  i  =  1,2,  defined  in 
(4)  and  (5)  at  the  top  of  next  page. 

If  furthermore  the  fusion  center  uses  the  AND  rule,  we 
have 

Proposition  2  For  the  distributed  test  of  indepen¬ 
dence  problem  with  one  bit  quantization  defined  above, 
if  we  assume  further  that  AND  rule  is  used  at  the  fu¬ 
sion  center,  i.e.,  Uq  =  1  if  and  only  ifU  =  V  =  1,  then 
the  optimal  local  decision  rule  is  given  by: 

{f  P(xj\xiHi)dxf 

1  jf  iBl _  >  £0 

Id,  p(xi\xiHo)dxi  ~  7I’1  (7) 

0  otherwise 

where  Di  =  {xi  :  P(Ui  =  1| xf)  =  1}  is  the  rejection 
region  for  hypothesis  Hq  at  ith  local  sensor. 

For  the  case  of  one  sided  hypothesis  testing  of  inde¬ 
pendence,  e.g.,  Hq  :  p  >  0  versus  Hi  :  p  =  0,  we  have 
the  following  corollary. 

Corollary  1  For  the  distributed  one  sided  hypothesis 
testing  of  independence  problem  with  one  bit  quantiza¬ 
tion  defined  above,  single  semi-infinite  intervals  for  D\ 
and  D2  form  a  PBPO  solution  for  minimum  probability 
of  error. 

The  fact  that  optimal  quantizer  has  semi-infinite  quan¬ 
tization  intervals  is  rather  appealing  as  it  allows  efficient 
search  of  a  single  threshold  for  quantizer  design.  Proof 
of  Propositions  1  and  2  as  well  as  Corollary  1  is  omitted 
due  to  space  limit. 

3  Numerical  examples 

Fig.  1  plots  I(U;V)  as  a  function  of  thresholds  t-\ 
and  t2  for  p  =  0.65.  Apparently  I(U\V)  achieves  its 
maximum  («  0.15)  when  (ti ,  ^2)  —  (0,0).  We  further 
conjecture  that,  this  point  is  actually  a  global  maxi¬ 
mum  which  is  corroborated  by  extensive  numerical  re¬ 
sults.  The  difficulty  in  proving  it’s  global  maximum  is 


P(U0  =  l\Ui  =  0,1 

rJl  =  j)]P(Ul  =  j\x  l) 

(4) 

P(£/o  =  0|J7i  =  l,l 

rA  =  j)]P(Ui  =  j\xd 

(5) 

0.2- 

0.15- 


-10  -10 


Figure  1:  Plot  of  I{U\  V)  as  a  function  of  thresholds  t\ 
and  t-2  for  p  =  0.65. 

that  we  do  not  have  an  analytical  expression  of  the  cu¬ 
mulative  distribution  function  for  a  bivariate  Gaussian 
distributed  random  variables. 

An  interesting  application  of  our  main  result  is  the 
spectrum  sensing  problem  in  cognitive  radio  network, 
where  multiple  secondary  users  collaborate  to  detect 
whether  the  primary  user  is  present  or  not.  While  the 
problem  is  well  understood  when  the  primary  user’s  sig¬ 
nal  is  fully  observed  (possibly  corrupted  by  noise),  it  is 
more  challenging  when  only  a  finite  bits  of  information 
from  each  secondary  user  can  be  communicated  to  a 
decision  maker.  Consider  the  following  simple  model  in 
which  local  sensors  Y\  and  Y2  receive  a  noisy  version  of 
the  original  signal  through  independent  additive  Gaus¬ 
sian  channels. 

Yff  =  Xn  +  IV",  (8) 

Y2n  =  Xn  +  N£,  (9) 

where  Xn  is  a  n  length  samples  of  the  primary  user’s 
signal,  i.e.,  Xn  =  [x\,  x2,  ■  ■  ■ ,  xn]  ^  0  when  the  pri¬ 
mary  user  is  transmitting  (hypothesis  Hq)  and  Xn  = 
[0,0,  •  •  • ,  0]  when  the  primary  user  is  silent  (hypothe¬ 
sis  Hi).  In  this  example,  we  assume  that  X  is  a  zero 
mean  independent  Gaussian  random  process  with  vari¬ 
ance  P ,  for  simplicity,  which  may  be  justified  by  the  use 
of  Gaussian  pulse  shaping  filter  used  in  digital  commu¬ 
nications.  The  noise  N\  and  N2  are  independent  stan¬ 
dard  Gaussian  random  variables. 

Upon  receiving  Yf1 ,  sensor  i  will  send  a  binary  de¬ 
cision  vector  u"  to  the  fusion  center,  the  fusion  center 


will  then  decide  wether  the  original  signal  is  present  or 
not.  Clearly,  if  X  is  present,  the  received  signals  at  lo¬ 
cal  sensors  Y\  and  Y2  are  correlated  (In  the  simulation, 
we  choose  P  =  2.857  to  make  sure  that  the  correlation 
of  Y\  and  Y2  under  H0  is  0.65  ).  We  use  the  following 
decision  rules,  for  k  =  1,  2  and  *  =  1, 2,  ■  ■  ■ ,  n 

uki  =  1  if  lJ kl  >  t  (10) 

VP  +  1 

The  fusion  center  decides  iim  =  1  if  and  only  if  u u  = 
U2i  for  i  =  1,  2,  •  •  • ,  n,  and  then  make  a  final  decision 
using  the  following  majority  rule, 

n 

u=  1  if  '^2u0i>t0(n)  (11) 

i= 1 

where  to(n)  is  chosen  so  that  the  probability  of  type  I  er¬ 
ror  Pei  =  0.1.  Since,  under  Hq,  Y^ii=iu0i  is  a  binomial 
distribution  with  probability  of  success  p  =  Pro(uu  = 
U2i),  which  can  be  easily  calculated,  to(n)  can  be  eas¬ 
ily  evaluated  numerically  for  each  n.  Randomization  is 
used  to  ensure  that  the  false  alarm  probability  to  be 
precisely  0.1  so  as  to  maximize  the  detection  probabil¬ 
ity. 


Figure  2:  Probability  of  error  for  spectrum  sensing. 

Fig.  2  shows  the  performance  of  the  above  algorithm. 
In  the  simulation,  we  assume  that  Pr(Ho)  =  0.8, 
and  we  choose  five  different  local  decision  thresholds 
(t  =  —1.5,  —0.5, 0,  0.5, 1.5)  in  (10).  To  compare  the 
performance,  we  also  plot  the  optimal  error  exponent 
I(U;V)  as  plotted  in  Fig.  1.  We  observe  that  as 
number  of  samples  increases,  the  probability  of  error 
decreases,  and  the  threshold  t  =  0  performs  the  best 
among  others.  Notice  that  the  vertical  axis  is  in  loga¬ 
rithmic  scale  and  the  slope  appears  to  be  equal  to  the 
plotted  I(U;V). 


tion  constraints.  In  particular,  with  one  bit  quantiza¬ 
tion,  we  derived  quantization  rules  for  single  threshold 
quantizer  at  the  local  sensors  that  optimize  the  error  ex¬ 
ponent.  For  distributed  one  sided  independence  testwe 
proved  that  semi-infinite  interval  quantizers  form  a  per¬ 
son  by  person  optimal  (PBPO)  solution  for  minimum 
probability  of  error. 

Appendix  -  Proof  of  Theorem  1 

With  one  bit  scalar  quantization,  optimizing  error  ex¬ 
ponent  is  equivalent  to  maximize  the  mutual  informa¬ 
tion  I(U;V).  Define,  under  H0,  Pij  =  Pr(U  =  i;V  = 
j).  i,j  =  {0, 1},  which  can  be  expressed  in  terms  of 
integration  of  (1)  given  the  single  threshold  quantizer 
assumption.  By  definition, 

Pr{U  =l)  =  Pr{X>ti)  =  Q{t1)  (12) 

Pr{V  =  1)  =  Pr(Y  >  t2 )  =  Q(t2)  (13) 


where  the  Q  function  is  complementary  cumulative  dis¬ 
tribution  function  for  standard  Gaussian  distribution. 
We  want  to  maximize  J(t/;  V),  where 


I{U-V)  =  H{U)  +  H{V)  -  H(U-  V)  (14) 
=  H(Q(ti))  +  H{Q(t2)) 

-H(P00,Poi,Pio,Pii),  (15) 


where  H(-)  is  the  Shannon  entropy  function. 

We  now  compute  the  first  partial  derivative  of 
I(U;V)  with  respect  to  t\  and  £2,  repectively.  We  get, 
with  tedious  but  straightforward  computation, 


dI(U; 

V ) 

1 

exp 

-t2 

ll 

f  1 

Q(ti) 

dti 

2 

\ 

1  -  Q{ti) 

+  [1- 

-Q( 

^2  - 
Vi 

-  ptl 

-  P2 

\  1 1  P bo 

)]  los  -5- 
no 

+Q{ 

£2  — 

a/i- 

ptl 

-  p1 

)log 

Po  1  1 

PiV 

dI(U ; 

V) 

1 

exp 

-4 

{ 

Q(t  2) 

dt2 

\/2t r 

2 

\ 

1  -  Q(t2) 

+  [1- 

-Q( 

tl  - 
V7! 

-  pt'2 

~  P2 

\  1  1  1^30 

)]  log  -b~ 
Gn 

+<2(' 

t\  — 

Vi- 

Ph 
-  P2 

)log 

P10  1 

P\  1  1 

(16) 


(17) 


One  can  easily  check  that  (ti,t2)  =  (0,0)  is  a  critical 
point,  i.e. ,  the  first  partial  derivatives  equal  0.  We  next 
check  its  Hessian  matrix: 


M  = 


a(p)  b(p)  \ 

Hp)  c{p)  J 


(18) 


4  Conclusion 


where 


Ap)  = 


d2I(U;V) 


dt\ 


I(ii,i2)  =  (0,0) 


In  this  paper,  we  studied  distributed  test  of  indepen¬ 
dence  of  bivariate  Gaussian  sources  with  communica- 


Notice  that,  form  Lemmas  2  and  3,  we  only  need  to 
prove  that 


,,  ,  d2I(U;V)t 
dP>  =  dhu — 

,  a  _  d2I(U;V)t 

dP)  -  - fa 2 - l(ti,ta)=(0,0) 

We  want  to  show  that  a(p)  <  0  and  det  M  =  b(p)2  — 
a(p)c(p)  >  0  for  all  p  €  [— 1,  0)  U  (0, 1]. 

We  can  easily  calculate  that, 

a(p)  =  c(p )  (19) 

=  ^[_4  +  7f=!log§7  +  4r^7ll‘® 

Next,  we  introduce  a  lemma  concerning  evaluating 
the  cumulative  distribution  function  of  a  standard  bi¬ 
variate  Gaussian  distribution  at  point  (0,0). 

Lemma  1  [9,  page  290] 

Poo(ti  =  t2  =  0)  =  Pn(ii  =  t2  =  0)  =  \  arcsin(p) 

4  Z7T 

Pm{ti  =  t2  =  0)  =  Pi0(ii  =*2  =  0)  =  ^--^-  arcsin(p) 

4  Z7T 

Using  (22)  and  (23),  we  can  further  get 

.  .  ,  .  1  .  ,  2 p  ,  7T  —  2  arcsin  p 

a(p)  =  c(p)  =  —  -4  +  log - - - ; — — 

27t  _  p2  7T  +  2  arcsm  p 


'  9  A  •  2  j  V  / 

7rz  —  4  arcsm  p 

, ,  .  1  r  2  ,  7T  +  2  arcsin  p 

b(p)  =  —  — ,  log - - - ; 

2ir  ^/i  —  p2  &  7r  —  2  arcsin  p 

87t  arcsin  p 

2  A  ^  (25) 

7rz  —  4  arcsm  p 

Next,  we  want  to  evaluate  functions  a(p),b(p)  and 
c(p)  with  the  help  of  the  following  two  lemmas. 

Lemma  2  For  a(p )  and  c{p)  defined  above,  we  have: 

a(p)  =  c(p)  <  0  (26) 

for  all  p  £  [—1,1]  and  the  maximum  is  achieved  when 

P  =  o. 

Lemma  3  For  the  function  b{p)  defined  above,  we 
have: 

b{p)>  0,  if  p  £  (0,1]  (27) 

b(p)  <  0,  if  p  S  [-1,0)  (28) 

b{P)  =  0,  if  p=  0  (29) 


~a(p)  >  b(p)  if  p  £  (0, 1]  (30) 

a(p)  <  b{p)  if  [-1,0)  (31) 

Define,  d(p)  =  —  a(p)  —  b(p)  and  e(p)  =  a(p)  —  b(p). 

We  want  to  show  that 

d{p)  >  0  if  p  £  (0, 1]  (32) 

e{p)  <  0  if  p  £  [-1,0)  (33) 

This  can  be  verified  by  noting  that 


1  .  „  1  —  p  ,  7T  +  2  arcsm  p 

—  —2<  / - log - 

2-7T  \  1  +  p  7T  —  2  arcsin  p 

8(7r  —  2  arcsin  p)  arcsin  p1 

9  i  :  2  > 

7rz  —  4  arcsm  p 

1  rr»  /I  +  p  .  7T  —  2 arcsin p 

—  [2  4  / - log - 

!  7r  V  1-p  7T  +  2  arcsin  p 

8(7r  +  2  arcsin  p)  arcsin  p1 

9  i  :  2  t 

7rz  —  4  arcsm  p 
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Form  Lemma  2,  we  can  see  that  a{p)  <  0  for  all  [9]  H.  Cramer,  Mathematical  Methods  of  Statistics, 
p  £  [-1, 0)  U  (0, 1]  is  satisfied.  Next,  we  want  to  prove  Princeton  University  Press,  1946. 
that  b2(p)  —  a(p)c(p )  <  0  for  all  p  ^  0  is  also  true. 


